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ABSTRACT 


This thesis presents a software algorithm that resends 
work packages to processors when one or more of the worker 
processors fails or when the link with one or more processors 
fails. There are two resend criteria used in this algorithm: 
"resend at end of initial assignment" and "resend at time 
out." The work, divided into several packages in order to run 
on several processors in parallel, will be completed as long 
as at least one worker processor remains working and 
communicating with the main processor. This algorithm could 
add some fault-tolerance to computer processing equipment in 


embedded systems. 
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THESIS DISCLAIMER 


The reader is cautioned that computer programs developed 
in this research may not have been exercised for all cases of 
interest. While every effort has been made within the time 
available to ensure that the programs are free of 
computational and logic errors, they cannot be considered 
validated. Any application of these programs without 
additional verification is at the risk of the user. 

Many terms used in this thesis are registered trademarks 
of commercial products. Rather than attempting to cite each 
individual occurrence of a trademark, all registered 
trademarks appearing in this thesis are listed below the firm 
holding the trademark: 

INMOS Limited, Bristol, United Kingdom: 

Transputer 

OCCAM 

T414 

ES O.0 

Transputer Development System (TDS) 
3L Limited, Livingston, United Kingdom: 

Parallel C 

Parallel Fortran 
United States Government, Ada Joint Program Office, 
Washington, Dc: 

Ada 


ParaSoft Corporation, Mission Viejo, CA: 
Express 


Microsoft Corporation, Redmond, WA: 
MS 


Bell Laboratories, New York, NY: 
UNIX 
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Digital Equipment Corporation, Maynard, MA: 
VMS 


Apple Computer, Cupertino, CA: 
Macintosh 
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I. INTRODUCTION 


A. BACKGROUND 
1. Shipboard Systems 

In today's combat systems, the ability to continue to 
operate with hardware failures is becoming more and more 
important. Today's shipboard computer systems require at best 
an automatic reload of the surviving system when a hardware 
failure occurs; at worst the system is completely inoperable. 
Some systems require a manual reload of the surviving system. 
mpieeor this affects the ability to “fight hurt." A computer 
system that would automatically stop using processors that are 
not processing (for whatever reason), but still continue to 
process all needed functions and operations would greatly 
increase the ability of a ship to "fight hurt." 

Current U.S. Navy systems using multiprocessors 
include: Aegis and all shipboard NTDS systems. In the Aegis 
system, there are four processors in the computer system. When 
one of these fails, the system automatically reloads the 
remaining three processors with software that has a reduced 
capability (primarily due to reduced available memory). This 
reload occurs in avery short time and thus the most important 
functions are peered with little user inconvenience. The 


major draw back is that it depends on the reload system to 


function properly. Thus some ability of the computer system 
to “fight hurt" is there if the reload capability is Siu 
functioning when a processor quits. 

Most shipboard NTDS systems have even less capability 
in this area. Most have three processors. When one of these 
quits, the remaining two must be manually reloaded with a 
reduced capable system. The time required for this manual 
reload is much longer than for the automatic reload used in 
the Aegis system. Thus the users are unable to function during 
the reload. This means that any ability to "fight hurt", where 
these NTDS systems are concerned, is dependent on the manual 
reload time as well as the proper functioning of the reload 
SYSten. 

Many U.S. Navy systems use only one processor in their 
computer systems. These have no ability to continue 
functioning when the processor fails. The ability of a ship 
to "fight hurt" would be enhanced if these systems had the 
ability to function when a processor quits. 

A link failure between processors or between systems, 
in any of the current shipboard systems, means the loss of the 
functional use of at least one processor. 

2. Fault-Tolerant Literature 

The computer literature that discusses computer systems 
that can stop using processors refers to fault-tolerant 
computing or graceful degradation. Fault-tolerance deals with 


handling faults by restoring either full or reduced 


capability. Graceful degradation deals generally with reduced 
capability. When a computer system has a graceful degradation 
capability, its: 

- downtime for repair is short 

- uninterrupted operation is longer 

- unavailability periods are short 


- overall processing power is not seriously affected by 
failures (Ref. 1]. 


In a multiprocessor system graceful degradation can be 
achieved by: 
- Reconfiguring the system: switching out faulty hardware 
(processors, etc.) or software (modules, tasks, etc.) 


and switching in assumed good hardware or software. 


- Masking the failed item: not using faulty hardware or 
software. 


Reconfiguration can restore full or degraded capability. 
Masking will usually result in degraded capability in some 
may. [Ref. 2) 

Reconfiguration of hardware can be done with a 
dedicated reconfiguration processor or each processor can 
preform part of the overall reconfiguration work. Another 
approach is to use a reconfiguring process running on each 
processor. This process runs in turn on each functioning 
processor using a "check list" to step through the recovery 
process. These approaches require normal processing to be 
stopped during the reconfiguring process.[Ref. 3] 

The processor network used by the above approaches is 


not specified. The network may or may not have spare 


processors that can be switched to when a processor fails. A 
matrix approach with one spare row and one spare column of 
processors gives a great deal of flexibility for replacing 
failed processor [Ref. 4]. 
B. PROBLEM STATEMENT 

Any computer system using more than one processor can have 
a fault-tolerant feature. To have a computer system continue 
to process when one or more of its processors stops (given 
some processors continue to process), is the problem tain 
thesis looks at. 
The objective of this thesis: 

- While a Mandelbrot Program 1s running on a 
multiprocessor system, disconnect the link between some 
processors and have the system complete the Mandelbrot 


picture, even when only one processor capable of 
PUNE 


C. THESIS OVERVIEW 

This thesis is presented in four Chapters and five 
Appendixes. 

Chapter I was the introduction to the problem. Chapter 
II describes the system and the language. Chapter III 
addresses the method used in reaching the objective, the 
relative speeds of processing with different numbers of 
processing processors. Chapter IV states the conclusions and 


suggests further research. 


II. TRANSPUTERS: PROGRAMMING LANGUAGES AND 


CAPABILITIES 


A. PROGRAMMING LANGUAGES 
1. OCCAM 

OCCAM is a high level programming language that is 
designed to run concurrent processes on a network of 
transputers. There are two prime concepts in OCCAM, they are: 
concurrency and communication. These allow processes to run 
simultaneously and transfer information, via channels, from 
process to process. It is based on concepts founded by David 
May in Experimental Programming Language and by Tony Hoare in 
Communicating Sequential Processes. [Ref. 5] 

It allows processes running on a transputer system to 
communicate only through channels. These channels’ are 
asynchronous, but require the send and receive processes to 
be ready to send and receive at the same time. This idea of 
being ready to send and receive simultaneously is Known as 
rendezvous. (Ada also has a rendezvous feature.) 

OCCAM has six kinds of constructions that are used to 
build a process from smaller processes (primitive or other). 
These constructions are: 

- IF: This construction guards a number of processes by 


a boolean expression. It is similar to other languages' 
IF statement. 


- CASE: This construction is used to select one of a 
number of options. It is similar to the Pascal CASE 
statement. 


- WHILE: This construction is used for loops. Again it is 
similar to the Pascal WHILE statement. 


- PAR: This construction is used to parallel processes. 
Processes under this construction are done at the same 
time. 


- ALT: This construction is used to combine processes that 
are guarded. It selects one of the processes whose guard 
is in the true state. 


This language allows the programmer to concentrate on. 


a small, manageable set of processes which can then be 
connected with other sets of processes. In OCCAM a set of 
processes or a set of interconnected processes can be regarded 
as a single process. [{Ref. 5] 

The above features make OCCAM a powerful and versatile 
language. It hasn't gained wide acceptance thus far probably 
due to the limited use of multiprocessor (transputer) systems 
and due to the development of parallel versions of other 
widely used languages. 

2. Parallel C 

Parallel C is another high level programming language 
that was developed for use on a system of transputers 
(Ref. 6, pp. xij. It is based on the abstract model that 
requires communication of sequential processes via 
communication channels or ports [Ref. 6, pp.32]. 

A program can consist of one or more tasks with all 


tasks executing simultaneously. Each task has its own area of 


memory for instructions and data along with a vector of input 
ports and a vector of output ports. This allows a task to be 
on a transputer by itself or with other tasks. (Each task is 
compiled and linked separately.) The tasks communicate via the 
ports which make each task able to function as a software 
black box. [Ref. 6, pp.32-33)] 

This allows processes that can be done in parallel to 
be written in different tasks and run on separate transputers 
simultaneously. This means that if a problem can be done in 
many pieces simultaneously, the tasks to do these pieces can 
be placed on several transputers and executed in parallel. 
This reduces the overall computation time. 

The software black box (tasks) and the hardware 
(transputers) are tied together with a program called the 
configurer. This program connects the ports of the various 
tasks and assigns each task to a transputer in a designated 
eyotem. fRef. 6, pp.35-36] 

The Parallel C described above provides a convenient 
way of using transputers without the need to learn OCCAM and 
its associated editor and Transputer Development Systen. 

Other Concurrent C languages have been developed for 
various multiprocessor systems [Ref. 7]. Unfortunately, since 
all concurrent or parallel C languages are a super set of C, 
the only commonality between these concurrent or parallel C 


languages is the underlying C language. 


3. Parallel Fortran 

Parallel Fortran, like Parallel C, was developed for 
use on a system of transputers (Ref. 8, pp. xviii]. There are 
some differences, however, between Parallel Fortran and 
Parallel C beyond the obvious differences between Fortran and 
C. Parallel Fortran has an ALT function that is similar in 
operation to the ALT in OCCAM [Ref. 8, pp.60]. Furthermore, 
Parallel Fortran has a set of Bit-Manipulation Functions [(Ref.. 
8, pp.426). It also has a set of ANSI Standard Intrinsic 
Functions for Fortran 77 that give Fortran its abilities for 
use in mathematic and scientific applications. 

Parallel Fortran is used much like Parallel C in that it 
uses processes that communicate only over channels and the 
processes can be run on one or more transputers 
(Ref. 8, pp. 33-35]. It uses tasks in much the same way as 
Parallel C, and even its configurer is nearly identical to 
Parallel ¢*s. (Ref. 635. po. 33—4 mn 

4. Ada 

Ada is DOD's high level programming language that is 
required for all imbedded systems contracted for after June 
1984 (Ref. 9]. When an Ada compiler becomes available for the 
Inmos transputer, research using the combination of Ada and 
the transputer can begin. With Ada on a transputer system, a 
true parallel processing use of the rendezvous feature can be 


incorporated in applications. 








An Ada compiler for the Inmos transputer is under 
development by Alsys, Inc. [Ref. 10]. It is expected to be 
available in October 1989 [Ref. 11]. This will be the first 
implementation of Ada for a multiprocessor system that does 


not have a shared memory system. 


B. EXPRESS 

Express is an operating system that runs under another 
operating system. This system allows a program to be 
parallelized and executed provided the proper compiler is on 
the host's operating system. Any compiler for use on the host 
operating system will work. The Express system is available 
for MS-DOS, UNIX, VMS and Macintosh operating systems. It 
provides facilities for: 


- generating runtime parameters that allows programs to 
adapt at runtime to the transputer system environment 


- automatic mapping of a large image into smaller images 
for individual processors 


- interface of the I/O system with the various processors 


- allowing global accumulation of data in the transputer 
system. [Ref. 12, pp.3-7] 


C. CAPABILITIES 

The Inmos transputers are a family of microprocessors 
measuring about 30mm by 30mm by 4mm [Ref. 13, pp. 47}. The 
major components are: processor, four link engines, timer, 


2K bytes RAM and external memory interface. (The T800 has 


4K bytes RAM vice 2K bytes and has a floating point unit.) 
These components are connected by an internal bus. Both the 
T800 and the T414 are 32 bit devices. The T212 is a 16 bit 
device. [Ref. 14, pp. 18, 194] These are in essence a 
computer on a chip with limited memory. The on chip memory 
has a very short access time. The external memory interface 
can address up to 4G bytes of off chip memory allowing the 
transputer to access more memory than many large computer 
systems in use today. 

Prior theses [Ref 14, 15] document a detailed description 
of the major transputer components and some detailed tests and 
analysis of transputer performance. Of these performance test 
results, the affects that communication links have on 
processor performance and visa versa are remarkable. The 
packet size effects the processor performance greatly. 
Continually communicating with small packets (less than 100 
bytes per packet) reduces processor performance greatly. 
Conversely, uSing large packets (1000 to 100,000 bytes) only 
reduce the processor performance by 25% at most. This test was 
done while communicating on all four links as quickly as the 
link system will send/receive the packets. The affect that the 
processor has on communications link performance was not 
Significant. [Ref. 15, pp.38-44] This seems to indicate that 
links have priority over the processor on the internal bus. 
This would keep processors from waiting on information to 


process. 
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D. CONCURRENT FAULT-TOLERANT TRANSPUTER RESEARCH 

A concurrent research project is in progress involving 
fault-tolerant transputer systems. This concurrent research 
involves the use of a transputer system whose links are 
connected through two crossbar switches. The links can be 
removed andag/or inserted while the system is running. When a 
link is removed, the system looks for another link between 
the transputers that need to communicate and sets the crossbar 
switches to facilitate the needed communication. Thus, the 
fixed communication links are able to be replaced with 
dynamically assigned links for direct communications between 
transputers. These crossbar switches are controlled by a 
Single transputer that takes communication requirements from 


each of the other transputers in the system. [Ref. 16] 
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IIIT. FAULT-TOLERANT SOFTWARE ALGORITHM 


A. BACKGROUND 
1. Fault-Tolerant Literature Leading to the Algorithm 
The fault-tolerance literature discusses many 
approaches. The masking technique mentioned earlier is user 
transparent and quick. The masking technique here implies 
static (no hardware switching) fault-recovery. However, a 
static technique can be employed while a dynamic technique is 
Switching hardware or software. [Ref. 2] 
2. The Processes Without the Algorithm 
The Parallel C compiler contained a demonstration 
program (Mandelbrot) that simply sent packages of work out to 
worker transputers. the worker transputers sent back the 
assigned package results as they completed the work on each 
package, see Figure 1. The order of sending started with 
package number 00 and proceeded in order through 99 as shown 
in Figure 2. The order of receiving depends on the amount of 


work to complete the assigned package calculations. [Ref. 6] 
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B. THE ALGORITHM 
1. The Idea 
While experimenting with the Parallel C compiler and 
the Mandelbrot demonstration program, two questions came to 
mind: 


- what happens when communication is lost with one or more 
worker transputer 


- can the remaining worker transputers pick up the work 
for those that have lost communication? 
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Figure 1. SOFTWARE ORGANIZATION 


This is one method of masking as the article "Redundant Parts 
Keep Systems Running" [Ref. 2] discusses. 
If the work can be done in spite of lost workers, 


imbedded systems could use this algorithm as a part of a 


fault-tolerant system. 


16. 












rope Te] ] os] so] | | 
or [aa Pas (oe fae [se os [99 [or | 
Pes [oe [oe [ae [oe | se [oe [oe fo | 
[oa 7a [as | 32 [ee ool 


Pol aol | 54 | 64 | 74 
ros [os tas | os ee oe 
Pos ae [es [5s [ss [se ee 76 | es | 96 | 


d ZY 


Figure 2. PATTERN DIVISION AND NUMBERING 


2. The Algorithm Details 

When communication is lost with one or more worker 
transputers, the results from the work packages already 
assigned to them is lost. Each of the worker transputers will 
have up to three work packages assigned to it at any time. The 
remaining worker transputers continued to process their 
previously assigned work while receiving and processing new 
work. Thus, even with only one working transputer the problem 
will be completed; however, there is a considerable time 
penalty. 

Thus, the main task was sending work packages and 
receiving results. All that needs to be done is to determine 
what results had not come back under some criteria. Two 
criteria are used in this thesis: 


- resend at end of initial assignment: this criteria waits 
to resend any work package until all packages have been 
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sent once and then resends those packages whose results 
are not back yet 


- resend at time out: this criteria resends a work package 
when a preset time has elapsed since it was first sent. 


The code in Appendix A has the Mandelbrot data 
structure with annotated modifications for the "resend at end 
of initial assignment" criteria. The code in Appendix B uses 
the data structure in Appendix A to implement the "resend at 
end of initial assignment" criteria. 

The "resend at time out" criteria data structure code 
is in Appendix C. It has additions to the code in Appendix A 
which allows the use of both criteria. Both criteria are 


implemented in the code in Appendix D. 


C. ALGORITHM TEST RESULTS 
1. Original Code 
The Mandelbrot code as supplied with the compiler, see 
Appendix E, was run on a system of 12 worker transputers 
(T414's). Figure 3 shows the system connections including the 
host PC and main transputer (a T414). The time to complete a 
Mandelbrot set averaged 108.6 seconds. 
2. Adding the "resend at end of initial assignment" 
criteria 
Next the Mandelbrot code was run with the "resend at 
end of initial assignment" criteria. Running the _ same 
Mandelbrot set as previously run on the system shown in 


Figure 3, the average time to complete the set was 109.6 


15 


seconds. Analysis of variance between the times with and 
without the "resend at end of initial assignment" criteria 
shows no significant difference at the .05 level of 
significance. Thus the new code doesn't significantly impact 


the computing time of this process. 
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Figure 3. SYSTEM CONNECTIONS 


The fault-tolerant feature of this code was next timed. 
The system started out with the connections shown in Figure 
3. While the Mandelbrot set was running, the link to four of 
the worker transputers was disconnected as shown in Figure 4. 
Table I shows the average time to complete when the link was 


disconnected at times shown. 
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3. Adding the "resend at time out" criteria 
Here, the "resend at time out" criteria was added to 
the original code with the "resend at end of initial 
assignment" criteria (running both criteria). The average time 
to complete the same Mandelbrot set as previously run, using 
the system shown in Figure 3, with different timer delays is 


as shown in Table II. 
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Figure 4. SYSTEM CONNECTIONS AFTER DISCONNECT 


Analysis of variance between the original code time 
and the code with both criteria shows that all are 


Significantly different at the .05 level of significance. 
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Since the 21 and 41 second delay times take longer to complete 


the problem than the original code, they are of less use than 


Table I. DISCONNECT TIME VS TIME TO COMPLETE 


LINK DISCONNECT TIME TOTAL TIME 
TO 

IN SECONDS AFTER START COMPLETE IN 
SECONDS 


20 142.0 


40 143.0 


60 128.0 


80 gage) 10) 





Table II. DELAY TIME VS TIME TO COMPLETE 


DELAY TIME AVERAGE TIME TO 
IN SECONDS COMPLETE IN SECONDS 


21 185.4 


41 Ih ENO) 3 


50 9 oan 





the 50 second delay. The 50 second delay took less time to 
complete than the original code. This would, with the 50 
second delay, indicate better use of the processors for 
completing the work. 

The fault-tolerant feature of this code was next timed. 


As with the test of the "resend at end of initial assignment" 


FS 


The fault-tolerant feature of this code was next timed. 
As with the test of the "resend at end of initial assignment" 
criteria, the link to four of the worker transputers was 
disconnected approximately 20 seconds after the Mandelbrot set 
was started. The average times to complete are shown in Table 
III. The average time to complete for 40, 60 and 80 second 
link disconnect times are shown in Tables IV, V and VI 


respectively. 


Table III. DELAY TIME VS TIME TO COMPLETE 
USING FAULT-TOLERANT FEATURE 
WITH 20 SECOND LINK DISCONNECT TIME 


DELAY TIME AVERAGE TIME TO 
IN SECONDS COMP@E LE. IN sECONDS 


Pah 232.4 
41 1/7 12.0 


ae, 166.2 





Table IV. DELAY TIME VS TIME TO COMPLETE 
USING FAULT-TOLERANT FEATURE 
WITH 40 SECOND LINK DISCONNECT TIME 


DELAY TIME AVERAGE TIME TO 
IN SECONDS COMPLETE IN SECONDS 


2 301.6 


41 Z2oee 


50 208.0 
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Table V. DELAY TIME VS TIME TO COMPLETE 


USING FAULT-TOLERANT FEATURE 
WITH 60 SECOND LINK DISCONNECT TIME 


DELAY TIME AVERAGE TIME TO 
IN SECONDS COMPLETE IN SECONDS 


at 293.6 
41 203.6 


50 164.4 





Table VI. DELAY TIME VS TIME TO COMPLETE 


USING FAULT-TOLERANT FEATURE 
WITH 80 SECOND LINK DISCONNECT TIME 


DELAY TIME AVERAGE TIME TO 
IN SECONDS COMPLETE IN SECONDS 


eu 274.2 


41 182.4 


50 148.0 





4. 


Results Summary 


The "resend at end of initial assignment" criteria 


no significant effect on the running of the program when 


transputers remained connected. The program will complete 


problem even though communication is lost with some of 


worker transputers. When both criteria are used, the 21 


had 
dt IL 
the 
the 


and 


41 second delays take longer than the original code; however, 


the 50 second delay takes less time than the original code. 


The fault-tolerant feature takes longer than the "resend at 


end of initial assignment" criteria. Further time tests are 


needed to find the best time to use. 
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IV. CONCLUSIONS AND FURTHER RESEARCH PROPOSALS 


A. CONCLUSIONS 

The INMOS transputers lend themselves readily to a fault- 
tolerant system. Here an application was made fault-tolerant 
by adding code to existing code. The two criteria for 
determining when an assumed fault has occurred can be used 
together. For practical use the individual application may 
need both or only one or the other. For example a system that 
never completes initial assignment, because it continually has 
work to assign, would not be able to use the "resend at end 
of initial assignment" criteria. 

This algorithm could be useful in any embedded system. 
These systems have several functions or tasks (herein called 
worker) that are done repeatedly. By using more than one 
processor to do a task, many tasks can become fault-tolerant 
using this algorithm along with appropriate resend criteria. 
Thus, the loss of communication with a processor or processors 
(either processor failure or link failure or both) would not 
stop a task from functioning. By making tasks fault-tolerant, 
the entire system becomes more fault-tolerant. 

Deadlines are very important in many embedded systems. 
Multiprocessors might be used in some areas in order to meet 


the required deadlines. If this is the case, adding additional 
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processors for fault-tolerance may be necessary to ensure that 


the deadlines can be met even with a failed processor. 


B. FURTHER RESEARCH 

Research in the area of fault-tolerance on transputer 
systems is a broad area. The software approach taken in this 
thesis is one of several possible approaches. The algorithm 
presented herein is simple and appears not to increase problem 
completion time. Further research is needed in this area and 
on this algorithm to be able to apply it to a practical 
system. 

Some recommended areas for further research are: 

- develop an algorithm for determining the best time delay 
for use in the "resend at time out" criteria that can 
be used for any process using a similar system 

- determine the affect of changing the time delay--used 
in the "resend at time out" criteria--to time from when 
a packet was last sent vice from when a packet was first 
sent 

- develop an algorithm that will use reconnected 
transputers that were disconnected during processing, 
thus allowing recovery while the process is running 
(dynamic recovery) 

- integrate this thesis with the one entitled Dynamic 
Reconfiguration and Link Fault-Tolerance in a Network 
of Transputers [Ref. 16], implementing dynamic recovery. 
Continued research involving fault-tolerant systems of 


transputers should yield computer system that are especially 


useful in embedded systems. 
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APPENDIX A 


DATA STRUCTURE HEADER FILE FOR MANDELM2.C CODE 


/*** MANDELTY.H' 
xkkx 


kk Parallel Mandelbrots 

kee 

kee These are the formats of the packets used to 
ka communicate between the master task and the 
kkk computation tasks. 

kik 

kak Rev 000 6-Dec-87 JF Created 

kik 

zee / 


typedef struct command structure { 
float x coord, y coord, gap; 
int eiEx- tiy, brx, bry; 

} COMMAND; 


typedef struct results structure { 
int PiGeetly, brx, bry; 
char counts[1008]; 

} RESULTS; 


/* Modified by W. Benage on 21 Mar 1989 */ 

/* This SAVE structure is used with MANDELM2.C code for */ 
/* fault tolerance. The fault tolerance for this code is */ 
/* for a link failure connecting work processors or */ 

/* connecting the work processors with the root processor. */ 
/* This alogrithm is not based on time but on what work *#/ 
/* has not returned complete. #*/ 


typedef struct save structure { 


int tlx, tly, brx, bry, returned; 
} SAVE; 


‘Reprinted with permission of 3L Ltd. 
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APPENDIX B 


CODE FOR "RESEND AT END OF INITIAL ASSIGNMENT" CRITERIA 


[kX 
xk**kk 
kkk 
kkk 
*x**k* 
kk 
x*k* 
kkk 
xk 
kKk* 
kKk* 
kk 
**k* 
kk 
kk* 
kx 
xk* 
Kx x 
kk* 
kkk 
kx*kx* 
**x 
kk 
kkxk 
kk 
kkk 
**x 
*x**k* 
xkKxk* 
kk 
kKk* 
kk 
kk 
**k* 
kkk 
kkx* 
kkx 
kx 
kkx* 
x**k* 
kkk 


MANDELM. C* 
Copyright (c) 1988 3L Ltda 


Example program: Mandelbrot set evaluation and 
display. NB: This application requires a Colour 
Graphics Adaptor. 


The application 


The application consists of two tasks: 


(1) MANDELM (this file). This is the master task, and 
runs in the root transputer. 

(2) MANDELW. This is the worker task, and runs in all 
the other transputers of the net. 


The flood configurer, FCONFIG, can be used to produce 
an executable file which will automatically 
distribute the worker tasks across an arbitrary 
network and route work packets from the master to the 
workers. 


It 1s also possible to run the application in a 
Single transputer. This will work automatically if 
the application is configured using FCONFIG. 
Alternatively, a static single-transputer 
configuration could be built by hand, using CONFIG. A 
suitable configuration file may be found in 
MANDEL.CFG. 


As well as various routines from the Parallel C 
run-time library, MANDELM must be linked with the CGA 
primitives module, CGA.BIN. A file MANDELM.LNK is 
supplied, which may be used to link MANDELM, like 
this: 


LINKT @MANDELM. LNK, MANDELM.B4 


Functions of the tasks 


‘Reprinted with permission. 
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eX 
kkk 
k** 
k*e* 
kak 
k** 
kkk 
k** 
kkk 
kK*X 
kX 
kk 
eX 
kek 
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kkk 
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kak 
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KX 
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kek 
kkk 
k** 
kkk 
kkk 
kkk 
kkk 


RaK / 


MANDELM is told by the user which part of the 
Mandelbrot set to evaluate. It then breaks this up 
into 100 packets, and sends them to the network of 
MANDELW's. As the results from each return, they 
are displayed on the PC's screen. 


Internals of MANDELM 


The task contains three threads. 


(1) The MAIN thread. 

This runs in the function main(). It intialises the 
other two threads and then goes into a loop, once 
round for each Mandelbrot display. For each, it gets 
instructions from the user, and then signals the SEND 
thread to start work by using the 
parameters are ready semaphore. It keeps track of 
completed work by examining tally done, which is 
incremented by RECEIVE everytime a RESULTS packet is 
displayed; whenever it notices that tally done has 
changed, it updates the PC's display; and when 

tally done reaches 100, MAIN knows that the display 
1s complete. 


(2) The SEND thread. 

This knows when to start work by examining the 
parameters are ready semaphore. It then breaks the 
job into 100 small jobs, places the details into a 
COMMAND structure (defined in file MANDEL.H) and uses 
the net _send function to send it off to the network 
of MANDELW's. Notice the SEND does not specify WHICH 
worker task is to do any particular job; this is 
decided by the network of router tasks. 


(3) The RECEIVE thread. 

This simply waits till a packet arrives from the 
network of MANDELW's and then displays it. Each 
packet contains all the necessary information to 
display it, so RECEIVE does not need to keep track of 
which packet is which. Every time it does a display, 
RECEIVE increments tally done, so that MAIN can tell 
when the whole display is complete. 


Ver. 1.1 16-Dec-87 JF 


/* Modified for fault tolerance by W. Benage 21 Mar 1989 */ 
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/* All code not identified as fault-tolerant */ 
/* modifications is 3L Ltd. original code. */ 


#include <stdio.h> 
#include <dos.h> 
#include <thread.h> 
#include <sema.h> 
#include <par.h> 
#include <net.h> 
#include <timer.h> /*Used in restart logic only */ 
#include "cga.h" 
#include "mandel.h" /* Modified for resend using 
fault-tolerance. */ 
/* SAVE data type added. */ 


/* Interface to SEND thread */ 
static SEMA parameters are ready; 


/* Fault-tolerance data structure. */ 
static SAVE s[10}[10]; /* These integers in [] must match */ 


the divisors in #define * / 
X INCREMENT and #define * / 
Y INCREMENT lines. x / 


/* Interface to RECEIVE thread */ 
static int tally done; 


/* Fault-tolerance variable */ 
static int resend; 


/* Current Mandelbrot and display parameters */ 
static float x cocrd, y  cecrd mga, 
static int threshi, thresh2, thresh3; 


/* Define the way the job is broken into packets */ 
#define X INCREMENT ((CGA_LORES XMAX+1)/10) 

#define Y INCREMENT ((CGA_YMAX+1)/10) 

#define PACKETS 100 


#define D 15625 /* Number of low pri ticks in 1 sec. */ 
/* Used in fault-tolerant restart only */ 


/* 
* This function is invoked by MAIN using thread_create to 


* create the SEND thread. 
* 


of 
send () 


Dit i, i.e /* i, j varibles are added */ 
/* for fault-tolerance. a7 
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COMMAND fers 
mei (;;) { 


/* Wait here until MAIN signals it's okay to go ahead */ 
sema wait (&parameters are ready); 


/* Fill in the fixed parts of the command */ 
c.x_ coord = x_coord; 

c.y coord = y coord; 

c.gap = gap; 


/* Send off the packets to be done. Each includes the 
coordinates of the top-left and bottom-right 
corners of the area to do. This both tells the 
worker task what values to generate and identifies 
the RESULTS packet when it arrives in the RECEIVE 
thread (since there's no guarantee that the results 
will arrive in the same orderthe commands were sent 
out) 


ud 


1 = 0; 
BOER 90; x) (CGA LORES@XMAX; x += A ENCREMENT) ({ 
Coetwoex; —C.0bx = xX + X INCREMENT - 1; 
J = 0; 
for (y = 0; y <= CGA_YMAX; y += Y_INCREMENT) { 
c.tly = y; c.bry = y + Y INCREMENT - 1; 
/* Send off the next packet */ 
net send (sizeof(COMMAND), &c, 1); 
/* Save coordinates of next packet */ 


Siew =—aC.bix; “Sil {j].brx = c.brx; 
Sieegiecay — Craly;, S{(ij{}].bry = c.bry; 
j += 1; 

} 

1 += 1} 


} 
/* resend any packet not yet received back */ 
resend = 0; 
while (tally done < PACKETS) { 
for (1 = 0; i < 10; i += 1) { 
for (j = 0; 3 < 10; 3 += 1) { 
if (s{ij[j]).returned == 0) { 
Co elx, es ifs | sclx; c.tly 
c.brx = s{ij}[{j].brx; c.bry 
resend += 1; 
net send (sizeof(COMMAND), &c, 1); 


ott 
nm 
“om 
ps ps 
a 
oe ee 
Lando LJ 
aa) 

5 
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* This function is invoked by MAIN using thread create to 
* create the RECEIVE thread. 


a7 


receive () 
{ 
RESULTS r; 
BE gle len, ready, xX, Y, 1, J, n, colour, rtn, eoumee 
/* j, xrtn, count are added for fault-tolerance */ 


count = 0; 
sep eee Ghia rR || 


/* Thread will wait here till a packet arrives */ 
len=net receive (&r, &ready); 


count += 1; 
if (tally done < PACKETS) { 


/* Lets the fault-tolerant SAVE system know when a packet 
has been received. The 1 is also used here for fault- 
EOLeranceca 


Fen: = 0; 
for (i.= 07 39-710) ee eee 
for?(j = 03 7) < 20a — 
if (s{iJ(j]-tlx == r.tlx && s[ij][(j}.tly == 
r.tly && s[{ij[j]-brx == r.brx && 
Sfij[}j).bry == -orEy 
s[{i})(j]-returned = 1; 
if (s{ij[{j]-.returned == 1) 
Yrtn += 1 


} 
1 2=0- 


/* The results packet includes the coordinates of the 
top-left and bottom-right corners of the data, so we 
know where to display it. 


oe 


for (y=r.tly; y<=r.bry; yt+t+) { 
for (x=r.tlxX; X<="5.DEx 78 een 
n = r.counts[i+t]; 
/* Received O means 1, etc. */ 


Pans: 


n += 1; 
/* Decide on the colour <- thresholds, and display */ 
colour = (n>=threshl) + (n>=thresh2) + 
(n>=thresh3) ; 
cga_lores plot (x, y, colour); 


} 


/* Update the tally of packets displayed. */ 
tally done = rtn; 


/* The fault-tolerant system accounts for all packets 
sent. */ 
else { 
tally done = count; 
/* Reset count when all packets are returned. */ 
if ((count - PACKETS) == resend) 
count = 0; 


} 


* 


* The MAIN thread runs here 
* 


a7 


main () 


{ 


float range; 
mzeeprevious tally, i, J; 
/* 1, j} are added for fault-tolerance. */ 


/* Make sure we have text mode (and clear screen), then sign 
on */ 


video mode (MONO 80COL TEXT MODE) ; 

printf ("\nCopyright (c) 1988 3L Ltd\n\n"); 

printf ("Example program: Mandelbrot set evaluation and 
display\n"); 

printf ("NB: This program requires a Colour Graphics 
Adaptor\n\n") 


/* Initialise this SEMA to 0O BEFORE we start the SEND 
thread. This means it will wait until we tell it it's 
safe to go ahead */ 


sema_init (&parameters are ready, 0); 
/* Now start the other two threads */ 
thread create (send, 20000, 2,0,0); 
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thread create (receive, 20000782 j107 00", 
for (/7am{ 


/* Initialize s matrix for detecting packets not */ 
returned for fault-tolerance. */ 
for (1 = 07 i < 2037 += ae 
for (j = 0: 3 < 107 jue 
s(ij)(j3]-returned = 0; 


} 


/* This will ensure that no other threads are using 
the C run-time library (in fact, in this case they 
won't be, but I have done it here as an example...) 

+7, 

sema wait (&par_sema) ; 

printf ("\ninput xX coordinate. a. 

scant 9('sf "ex ncoome: 

printf (“Input Y coordanace:e.); 

SCant ("2i", sy coord), 

printf ("Input Y range: cL a 

scanf ("%f", &range) ; 

gap = range / (float) (CGA_YMAX+1) ; 

y_ coord = y coord + range; 





printf ("Threshold 1: "); scant ("a") éthreciiee 


printf ("Threshold 2: "); scant ("td") Gthwecmege 
printf ("Threshold 3: "); scanf ("td", S&threshae 
getchar (); /* Consume the final NL */ 


/* We have finished with the C RTL - release it */ 
sema_ signal (&par_sema) ; 


/* Into graphics (CGA low resolution) mode */ 
video mode (CGA _LORES GRAPHICS MODE) ; 


/* Before we set SEND going, reset the count of 
finished packets to zero - RECEIVE will count it 
back up 

“7, 

tally done = 0; 

/* All ready - set it going! */ 

sema_ Signal (&parameters are ready) ; 


previous tally = 0; 


/* Until all the packets have been done, just keep 


updating the display when necessary 
* 


while (tally done < PACKETS) { 
while (tally done==previous tally) { 
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/* Wait here till something happens. Use 
thread deschedule to save CPU time */ 
thread deschedule (); 


} 

/* Send the picture up to the PC's display memory */ 
cga_update (); 
previous tally = tally done; 

} 


/* This ensures that the PC display is up-to date. */ 
cga_update (); 


/* Once again, wait for the RTL to be free; then beep 
and wait till the user strikes any key */ 

sema_ wait(&par_sema) ; 

putchar ('\007'); 

getchar (); 

sema_ Signal (&par_sema) } 


/* Clear the screen and set text mode again */ 
video mode (MONO 80COL TEXT MODE) ; 


/* Print the number of packets resent to monitor fault- 


tolerant system. */ 

printf ("Resend = "); 
printf ("%d",resend) ; 
omnt cm \n!) 3 


/* Wait for all resent packets to return or */ 

/* if failure occured during processing, */ 

/* wait 2 min. * / 

1 = 0; 

while ((tally done < (PACKETS + resend)) && (i < 120)) 


timer delay (D); 


i += 1; 
thread deschedule (); 


Ak 


APPENDIX C 


DATA STRUCTURE HEADER FILE FOR MANDELM3.C CODE 


/*** MANDELTY.H? 
kik 


kaK Parallel Mandelbrots 

xxaxX 

kes These are the formats of the packets used to 
kaAK communicate between the master task and the 


Tas computation tasks. 

x** 

kK Rev 000 6-Dec-87 JF Created 
xx*exX 

exe / 


typedef struct commandes rr uceure | 
float xX coord, y coonemecap, 
ite Cixe tly .2brEee oa, 

} COMMAND; 


typedef struct résulvteiterriuccumes, 
Prt Gly <cly, tom Dry 
char Counts! 1006]; 

) RESULTS; 


/* Modified by W. Benage on 29 Mar 1989 */ 

/* This SAVE structure is used with MANDELM3.C code as */ 

/* modified for fault tolerance. The fault tolerance for */ 
/* this code is for a link failure connecting work */ 

/* processors or connecting the work processors with the */ 
/* root processor. This alogrithm is based on time and on */ 
/* what work has not returned complete. */ 


typedet struct Save sctruceures, 


int tlx, tly, brx, bry, returnedeaae me: 
1 SAVE? 


-Reprinted with permission. 
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APPENDIX D 


CODE FOR "RESEND AT TIME OUT" AND "RESEND AT END OF INITIAL 


[RX 
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ASSIGNMENT CRITERIA 
MANDELM.C" 
Copyright (c) 1988 3L Ltd 
Example program: Mandelbrot set evaluation and display. 
NB: This application requires a Colour Graphics 


Adaptor. 


The application 


The application consists of two tasks: 


(1) MANDELM (this file). This is the master task, 
and runs in the root transputer. 
(2) MANDELW. This is the worker task, and runs in 


all the other transputers of the net. 


The flood configurer, FCONFIG, can be used to produce 
an executable file which will automatically distribute 
the worker tasks across an arbitrary network and route 
work packets from the master to the workers. 


It is also possible to run the application ina single 
transputer. This will work automatically if the 
application is configured using FCONFIG. Alternatively, 
a static single-transputer configuration could be built 
by hand, using CONFIG. A suitable configuration file 
may be found in MANDEL.CFG. 


As well as various routines from the Parallel C 

run-time library, MANDELM must be linked with the CGA 

primitives module, CGA.BIN. A file MANDELM.LNK is 

supplied, which may be used to link MANDELM, like this: 
LINKT @MANDELM.LNK,MANDELM.B4 


Functions of the tasks 


ap @@ ea eo = 22 622 = & = 6 22 = = 2p = &® &® 6 = = =e 


“Reprinted with permission. 
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kK / 


MANDELM is told by the user which part of the 
Mandelbrot set to evaluate. It then breaks this up into 
100 packets, and sends them to the network of 
MANDELW's. As the results from each return, they 

are displayed on the PC's screen. 


Internals of MANDELM 


The task contains three threads. 


(1) The MAIN thread. 

This runs in the function main(). It intialises the 
other two threads and then goes into a loop, once round 
for each Mandelbrot display. For each, it gets 
instructions from the user, and then signals the SEND. 
thread to start work by using the parameters are ready 
semaphore. It keeps track of completed work by 
examining tally done, which is incremented by RECEIVE 
everytime a RESULTS packet is displayed; when- 

ever it notices that tally done has changed, it updates 
the PC's display; and when tally done reaches 100, MAIN 
knows that the display is complete. 


(2) The SEND thread. 

This knows when to start work by examining the 
parameters are ready semaphore. It then breaks the job 
into 100 small jobs, places the details into a COMMAND 
structure (defined in file MANDEL.H) and uses the 
net_send function to send it off to the network of 
MANDELW's. Notice the SEND does not specify WHICH 
worker task is to do any particular job; this is 
decided by the network of router tasks. 


(3) The RECEIVE thread. 

This simply waits till a packet arrives from the 
network of MANDELW's and then displays it. Each packet 
contains all the necessary information to display it, 
so RECEIVE does not need to keep track of which packet 
is which. Every time it does a display, RECEIVE 
increments tally done, so that MAIN can tell when the 
whole display is complete. 


Ver. 1.1 16-Dec-87 JF 


/* Modified for fault tolerance by W. Benage 29 Mar 1989 */ 
/* Lines not identified as added for fault-tolerance are */ 
/* original 3L Ltd. code. */ 


#include <stdio.h> 
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#include <dos.h> 

#include <thread.h> 

#include <sema.h> 

#include <par.h> 

#include <net.h> 

#include <timer.h> 

#include "cga.h" 

#include "mandel.h" /* Modified for fault-tolerance. */ 
/* SAVE data type added . */ 


/* Interface to SEND thread */ 
static SEMA parameters are ready; 


/* Fault-tolerance data structure (SAVE). */ 

static SAVE s[{10][10]; 
/* These integers in [{] must match the */ 
/* divisors in #define X_INCREMENT and */ 
/* #define Y INCREMENT lines. */ 


/* Interface to RECEIVE thread */ 
static int tally done; 


/*Fault-tolerant variables. */ 
static int resend; 
static int no updated, last_i, last j; 


/* Current Mandelbrot and display parameters */ 
Beaeac float x coord, y coord, gap; 
static int threshi, thresh2, thresh3; 


/* Define the way the job is broken into packets */ 
#define X INCREMENT ((CGA LORES XMAX+1) /10) 

#define Y INCREMENT ((CGA_ YMAX+1)/10) 

#define PACKETS 100 


#define D 15625 /* Number of low pri ticks in 1 sec. */ 
/* Part of fault-tolerant timer system. */ 


* This function is invoked by MAIN using thread create to 
* create the SEND thread. 


a / 
send () 
/* kK, 1, 1, J, and delay are integer variables added */ 
/* for fault-tolerance. */ 


sghe k, le i, au x, Y, delay; 
COMMAND C; 
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delay = 50 * D; 
/* Sets length of fault-tolerant delay in sec.*/ 
for (Caan 


/* Wait here until MAIN signals it's okay to go ahead */ 
sema wait (&parameters are ready); 


/* Fill in the fixed parts of the command */ 
c.x coord = x_coord; 

c.y coord = y coord; 

C.gap = gap; 


/* Send off the packets to be done. Each includes the 
coordinates of the top-left and bottom-right corners of 
the area to do. This both tells the worker task what values. 
to generate and identifies the RESULTS packet when it 
arrives in the RECEIVE thread (since there's-no guarantee 
that the results will arrive in the same order the commands 
were sent out) * / 


resend = 0; 
/* Initialized for fault-tolerant system. */ 
1 = 0; /* Initialized for fault-tolerant system. */ 
for (x = 0; x < CGA_LORES XMAX; x += X_INCREMENT) { 
c.tlx = x; c.brx = x + X_INCREMENT - 1; 
} = 0; /* Initialized for fault tolerant system. */ 


for (y = 0; y <= CGA_YMAX; y += Y_INCREMENT) { 
c.tly = y; ec.bry = y + Y INCREME 
/* Send off the next packet */ 
net send (sizeof(COMMAND), &c, 1); 

/* Save coordinates of next packet and time sent */ 


/* for fault-tolerant system. */ 

S(1)[(j].-time = timer _now (); 

s(ij(j).tlx = c.tlx; s{f{ij{q9).brx = clbme 
s[ij(j).tly = c.tly; s[ijfg)-bry = ese 
/* Resend any packet not received back in the */ 
/* "delay" time for fault-tolerance. * / 


for (k = 0; k <= i; k += 1) { 
for (1 = 0; 1 <= 3; 1 += 1) { 
if ((s(k)(1).returned == 0) && 

((timer now () - s{k][{1].time) > delay) ) { 
c.tlx = s[(k](1]).tlx; c.tly =—s[{k][{1].tly; 
c.brx = s{k)(1).brx; c.bry = s[k][1]).bry; 
resend += 1; 

net send (sizeof(COMMAND), &c, 1); 
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Ceelx = Ssfij(j)-tlx-? 
ev.brx = sfi)[j})-brx; 
ye += 1; 

1 += 1; 


/* resend any packet not yet received back for */ 
/* fault-tolerance */ 
while (tally done < PACKETS) { 
for (1. = O;7m! < 107-4 += 1) { 
for (j = 0; j < 10; j += 1) { 
if ((s[{i}){j].returned == 0) && 

((timer now () - s[{i}[j]-time) > delay)) { 
Coole —wclaietix; cotive= Stil[j]-tly; 
Copms = S(ii)]-brx; ¢c.bry s{ij(j]-.bry; 
resend += 1; 
net send (sizeof(COMMAND), &c, 1); 


} 
else 
thread deschedule (); 


/* 
* This function is invoked by MAIN using thread _ create to 
Bmereate the RECEIVE thread. 

* 

e/ 


receive () 


RESULTS r; 
f* j, n_inc, rtn, and count are added for the */ 


/* fault-tolerant system. */ 
int Metweseaay, X, VY, 2, 3, M, Mine, colour, rtn, 


count; 


count = 0; /* Initialized for fault-tolerant system. */ 
for (77) { 
/* Thread will wait here till a packet arrives */ 
len=net receive (&r, &ready) ; 


count += 1; 
/* Counts number of packets received back for */ 
/* for the fault-tolerant system. */ 


3 / 


/* "if" statement rejects all packets received after */ 
/* number 100. Part of fault-tolerant system. */ 
if (tally done < PACKETS) { 


/* The results packet includes the coordinates of the 
top-left and bottom-right corners of the data, so we 
know where to display it. 


i / 
rtn = 0; /* Initialized for fault-tolerant system */ 


/* Fault-tolerant code for determining what packets have */ 
/* returned. */ 


for (i = 0; i < 10; i t= 1) { 
for (j = 0; 3 < 10; j += 1) { 


if (s{ijJ({j].tlx == r.tlx && s[{i}[{j].tly == 
r.tly && s{i]J{j])-brx == r.brx && 
S[ij(j]-bry == r.bry) { 
if (s{ij]J{j])-returned == 0) { 


n_inc = 0; 
/* "for" loops using y and x are 3L Ltd. code */ 
for (y=r.tly; y<=r.bry; ytt+) { 
for (x=r.tlx; x<=r.bGx; 34 
n = r.counts [(nianer a 
/* Received 0 means 1, etc. */ 
n += 1; 
colour = (n>=threshl) + 
(n>=thresh2) + 
(n>=thresh3) ; 
cga_lores plot (x, y, colour); 
} 
} 
/* Remaining code in this thread is part of fault- */ 
/* tolerant system. */ 
no updated += 1; 


/* Let SAVE sys know when a packet has been received */ 
s{ij({j}-returned = 1; 
last_i = i; last_j = j; 
} 
if (s{i][(j].returned == 1) { 
rtn += 1; 
} 


} 


/* Update the tally of packets displayed */ 
tally done = rtn; 


/* Set i to zero for the 2 min. timer when tally done is */ 
/* equal to or greater than PACKETS. * / 
1 = 0; 
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) 


/* Account for all packets sent or wait 2 min. */ 


else { 
if (((count - PACKETS) <= resend) && (i < 120)) { 


tally done = count; 
1 += 1; 


/* Reset count when all packets are returned or 2 min */ 


/* have elapsed. * / 
if (((count - PACKETS) == resend) || (i >= 120)) { 


count = 0; 


) 


/* 
* The MAIN thread runs here 
* 


a 


main () 


{ 


float range; 
/* 1 and j are added for fault-tolerant system. */ 


int previous tally, i, j? 


/* Make sure we have text mode (and clear screen), then 
Sign on */ 


video mode (MONO 80COL TEXT MODE) ; 

prance ("\ncopyright (c) 1988 3L Ltd\n\n"'); 

printf ("Example program: Mandelbrot set evaluation and 
Gdisplay\n") ; 

printf ("NB: This program requires a Colour Graphics 
aaapcor\n\n") ; 


/* Initialise this SEMA to 0 BEFORE we start the SEND 
thread. This means it will wait until we tell it it's 
safe to go ahead * / 


sema_init (&parameters are ready, 0); 
/* Now start the other two threads */ 
thread create (send, 20000, 2,0,0); 
thread create (receive, 20000, 2,0,0); 
Bore ,;) { 
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/* Initialize s matrix for detecting packets not */ 
/* returned for fault-tolerant system. */ 
for (1 =O? i < 10; Gi —see 
for (j = 0; j3 < 10; j += 1) { 
s{i)({j].returned = 0; 


} 
no updated = 0; /* Initialized for the */ 
/* fault-tolerant system. */ 


/* This will ensure that no other threads are using 
the C run-time library (in fact, in this case they 
won't be, but I have done it here as an example...) 


uA 
sema_ wait (&par_sema) ; 
printf ("\nInput xX coordinate: "); scant “(3 emae 


&xX coord); 


/* 
+h 


printf ("Input Y coordinate: "); scanf ("%f", &y Coomame 
Printt (input yeeange: "); scanf ("*f", &range); 
gap = range / (float) (CGA _YMAX+1) ; 

y coord = y coord + range; 


printf ("Threshold 1: "); scanf ("%d", &threshigg 
printf ("Threshold 2: “); scanf ("sd", &thresh 
printf ("Threshold 3: "); scanf (“%d", &threshoaye 
getchar (); /* Consume the final NL *#/ 


/* We have finished with the C RTL - release it */ 
sema_ Signal (&par_sema) ; 


/* Into graphics (CGA low resolution) mode */ 
video mode (CGA_LORES GRAPHICS MODE) ; 


/* Before we set SEND going, reset the count of finished 
packets to zero - RECEIVE will count it back up 
a7 

tally done = 0; 

/* All ready - set it going! */ 

sema_Signal (&parameters_ are ready) ; 


previous tally = 0; 


Until all the packets have been done, just keep updating 
the display when necessary 


while (tally done < PACKETS) { 
while (tally done==previous tally) { 
/* Wait here till something happens. Use 
thread _deschedule to save cpu time 


a 
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thread deschedule (); 


} 
/* Send the picture up to the PC's display memory */ 
cga_ update (); 
previous tally = tally done; 
} 


/* This ensures that the PC display is up-to date. */ 
cga_update (); 


/* Once again, wait for the RTL to be free; then beep and 
wait till the user strikes any key 
uA 
sema wait(&par_sema) ; 
putchar ('\007'); 
getchar (); 
sema signal (&par_sema) ; 


/* Clear the screen and set text mode again */ 
video mode (MONO 80COL TEXT MODE) ; 


/* Print out items monitoring fault-tolerant system. */ 


jOheal guess 
jnealiqhese 
BeIlIncel 
pYrincetl 
oha@al)@ ese 
pr inet 
JSpgilighese 
jonesligrers 
jSieme ueid 
pied cis 
Peintt 


("Number updated = 
(isa no updated) ; 
CUNT ) 5 
("Last 1 = "); 
(esate last. 1)"; 

(" Last 5 = "); 
("Sa", last 5); 
(CUES 8 ae 

("Resend = "); 
("Sda",resend) ; 


(UN ) : 


Bs) yr 


/* Wait for all resent packets to return or */ 
/* if failure occured during processing, wait 2 min. */ 


/* 
/* 


io OF 


This insures all packets resent by fault-tolerant */ 
system have returned. */ 


while ((tally done < (PACKETS + resend)) && (i < 120)) 


timer delay (D); 
i += i> 
thread deschedule (); 
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APPENDIX E 


ORIGINAL CODE 
MANDELM.C° 


Copyright (c) 1988 3L Ltd 
Example program: Mandelbrot set evaluation and display. 


NB: This application requires a Colour Graphics 
Adaptor. 


The application 


The application consists of two tasks: 


(1) MANDELM (this file). This is the master task, and 
runs in the root transputer. 

(2) MANDELW. This is the worker task, and runs in 
all the other transputers of the net. 


The flood configurer, FCONFIG, can be used to produce 
an executable file which will automatically distribute 
the worker tasks across an arbitrary network and route 
work packets from the master to the workers. 


It is also possible to run the application ina single 
transputer. This will work automatically if the 
application is configured using FCONFIG. Alternatively, 
a static single-transputer configuration could be built 
by hand, using CONFIG. A suitable configuration file 
may be found in MANDEL.CFG. 


As well as various routines from the Parallel C 
run-time library, MANDELM must be linked with the CGA 
primitives module, CGA.BIN. A file MANDELM.LNK is 
supplied, which may be used to link MANDELM, like this: 


LINKT @MANDELM.LNK,MANDELM. B4 


Functions of the tasks 


MANDEIM is told by the user which part of the 


-Reprinted with permission. 
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xe / 


MANDELM is told by the user which part of the 
Mandelbrot set to evaluate. It then breaks this up into 
100 packets, and sends them to the network of 
MANDELW's. As the results from each return, they 

are displayed on the PC's screen. 


Internals of MANDELM 


The task contains three threads. 


(1) The MAIN thread. 

This runs in the function main(). It intialises the 
other two threads and then goes into a loop, once round 
for each Mandelbrot display. For each, it gets 
instructions from the user, and then signals the SEND 
thread to start work by using the parameters are ready 
semaphore. It keeps track of completed work by 
examining tally done, which is incremented by RECEIVE 
everytime a RESULTS packet is displayed; when- 

ever it notices that tally done has changed, it updates 
the PC's display; and when tally done reaches 100, 
MAIN knows that the display is complete. 


(2) The SEND thread. 

This Knows when to start work by examining the 
parameters are ready semaphore. It then breaks the job 
into 100 small jobs, places the details into a COMMAND 
structure (defined in file MANDEL.H) and uses the 

net send function to send it off to the network of 
MANDELW's. Notice the SEND does not specify WHICH 
worker task is to do any particular job; this is 
decided by the network of router tasks. 


(3) The RECEIVE thread. 

This simply waits till a packet arrives from the 
network of MANDELW's and then displays it. Each packet 
contains all the necessary information to display it, 
SO RECEIVE does not need to keep track of which packet 
is which. Every time it does a display, RECEIVE 
increments tally done, so that MAIN can tell when the 
whole display is complete. 


Ver. 1.1 16-Dec-87 JF 


#include <stdio.h> 
#include <dos.h> 
#include <thread.h> 
#include <sema.h> 
#include <par.h> 
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#includemega in! 
#include "mandel.h" 


/* Interface to SEND thread */ 
static SEMA parameters are ready; 


/* Interface to RECEIVE thread */ 
static int tally done; 


/* Current Mandelbrot and display parameters */ 
static float x_coord, y coord, gap; 
static int threshl1, thresh2, thresh3; 


/* 


Define the way the job is broken into packets */ 


#define X INCREMENT ((CGA_LORES_XMAX+1)/10) 
#define Y INCREMENT ((CGA YMAX+1) /10) 
#define PACKETS 100 


/* 


* This function is invoked by MAIN using thread create to 


* 
* 


create the SEND thread. 


ah 
send () 
nt oe ve 
COMMAND C; 
fOr Mey t 


/* Wait here until MAIN signals it's okay to go ahead */ 
sema_ wait (&parameters are ready); 


/* Fill in the fixed parts of the command */ 
c.x_ coord = x_coord; 
c.y coord = y coord; 


CcC.gap = gap; 


/* Send off the packets to be done. Each includes the 
coordinates of the top-left and bottom-right corners 
of the area to do. This both tells the worker task 
what values to generate and identifies the RESULTS 
packet when it arrives in the RECEIVE thread (since 
there's no guarantee that the results will arrive in 
the same order the commands are sent out) 


WA | 
for (x = 0; x < CGA _LORES XMAX; x += X_INCREMENT) { 
c.tlx = x; c.brx = x + X_INCREMENT - 1; 
for (y = 0; y <= CGA _YMAX; y += Y_INCREMENT) { 
c.tly = y; c.bry = y + Y_INCREMENT -— 1; 
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/* Send off the next packet */ 
net send (sizeof(COMMAND), &c, 1); 


* This function is invoked by MAIN using thread create to 
* create the RECEIVE thread. 


oy 


receive () 


RESULTS 1 aes 
amit lenyeseady, XV e,n, colour; 
mor (7) { 


/* Thread will wait here till a packet arrives */ 
len=net receive (&r, &ready) ; 
1 = 0; 


/* The results packet includes the coordinates of the 
top-left and bottom-right corners of the data, so we 
know where to display it. 

* 

for (y=r.tly; y<=r.bry; ytt+) { 

On uex—o. tix; X<=rvbrx; X++) { 
n = r.counts[i++]? 
/* Received 0 means 1; received 255 means 256 */ 
n += 1; 
/* Decide on the colour <- thresholds, and 
Gdisplay...%/ 
colour = (n>=threshl) + (n>=thresh2) + 
(n>=thresh3) ; 
egja ores plot (x;y, .colour) ; 


) 


/* Increment the tally of packets displayed */ 
tally done += 1; 


/* 
* The MAIN thread runs here 
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* 


%/: 


main () 


{ 


float range; 
int previous tally; 


/* Make sure we have text mode (and clear screen), then sign 

on */ 

video mode (MONO 80COL TEXT_MODE) ; 

printf ("\nCopyright (c) 1988 Shy Led\na woe 

printf ("Example program: Mandelbrot set evaluation and 
display \nio; 

printf ("NB: This program requires a Colour Graphics. 
Adaptor\n\n") ; 


/* Initialise this SEMA to O BEFORE we start the SEND 
thread. This means it will wait until we tell it it's 
safe to go ahead */ 


sema_ init (&parameters are ready, 0); 
/* Now start the other two threads */ 
thread create (send, 10COQ}m 27 oon, 
thread create (receive, 10000, 2,0,0); 


fOuw() { 

/* This will ensure that no other threads are using the C 
run-time library (in fact, in this case they won't be, 
but I have done it here as an example...) * / 


sema_ wait (&par_sema); 
printf ("\ninput xX coordinage an, 

scanf ("%f", &x coord) ; 
printf ("Input Y coordinate: "); scanf ("tf", &y cooname 
printf ("Inputeyssange- "); scanf ("%f", &range); 
gap = range / (float) (CGA_YMAX+1) ; 
y_coord = y coord + range; 
printf ("Threshold 1: "); scanf ("td") Sthrechipe 
printf ("Threshold 2: "); scanf ("td") séthresn2 .. 
printf ("Threshold 3: "); scanf ("sa") Gthresmae, 
getchar (); /* Consume the final NL */ 


/* We have finished with the C RTL - release it */ 
sema_ Signal (&par_sema) ; 


/* Into graphics (CGA low resolution) mode */ 
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video mode (CGA_LORES_GRAPHICS MODE) ; 


/* Before we set SEND going, reset the count of finished 
packets to zero - RECEIVE will count it back up */ 


tally done = 0; 
/* All ready - set it going! */ 
sema Signal (&parameters are _ ready) ; 


previous tally = 0; 


/* Until all the packets have been done, just keep 
updating the display when necessary * / 
while (tally done < PACKETS) { 
while (tally done==previous tally) { 
/* Wait here till something happens. Use 
thread deschedule to save cpu time * / 
thread deschedule (); 


} 

/* Send the picture up to the PC's display memory */ 
cga_update (); 

previous tally = tally done; 


/* In case tally done was updated to = PACKETS AFTER 
the last cga_update, do another one to ensure the 
PC's display 1s up-to-date */ 


cga_update (); 


/* One again, wait for the RTL to be free; then beep 
and wait till the user strikes any key * / 


sema_wait(&par_ sema) ; 
pueechar ('\007'); 
getchar (); 
sema_signal(&par_sema) ; 


/* Clear the screen and set text mode again */ 
video mode (MONO 80COL TEXT MODE) ; 
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